ON THE CAPACITY REGION OF MULTIPLE ACCESS ADDER 
Q-. CHANNEL 
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D ' Abstract. We compute the capacity region of the s-user Multiple Access 

C/j , Adder Channel and obtain an explicit description of this region as only one 

polyhedron, instead of a convex hull of infinitely many polyhedrons. We obtain 
this by proving a conjecture raised by Lindstrom in 1975, which in turn follows 
from certain convexity results, both continuous and discrete. 
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1. Preliminaries 
1.1. Multiple Access Channel. 

Definition 1. An s-user Multiple Access Channel (MAC) consists of s input al- 
phabets, Xi, X2, . . • , X s , an output alphabet y , and a probability transition matrix 
P(y I a?i, %2i ■ ■ ■ ,x s ), which defines the probability of receiving the output symbol 

y given input symbols x%, x 2 , ■ ■ ■ ,x s . 

G\ ' 

O 

Definition 2. An (Mi, M 2 , . . . , M s ; n) code for the s-user MAC consists of s en- 
coding functions ei, . . . , e s which encode each message m^ G [MA = {1,2,... , MA 

-<— > 

a 

0: e f : [Mi] - X? 

>! mi ^ e%(mi) = (ef\m),ei (mA,... ,ef\m))', 

r^ ' arid a decoding function: 



d : ^ n -> [M x ] x [M 2 ] x • • • x [M a ] 

(yi, ... ,j/ n ) i-f (mi,... ,m s ). 

The model of communication is as follows (see Fig. 1). Each user i chooses a 
label rrti uniformly from its message set [Mj] and sends the codeword ej(mj). All 
the users send their codewords simultaneously, symbol by symbol. After n trans- 
missions, a sequence of n output symbols (2/1,2/2, • • • , Vn) is received according to 
the transition probability matrix. This is then decoded by the decoding function 
d into a set of messages (mi, . . . , rh s ). If the decoded messages are identical to 
the messages sent, the communication has succeeded. Otherwise, we say an error 
has occurred. 
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Fig. 1. Multiple Access Channel 



Definition 3. TTie average probability of error for an (Mi, M 2 , . . . , M s ; n) code 

is 

1 



p(«) 



M 1 M 2 ---M i 



^ P r ( rf(^ n ) 7^ (mi, ... , m s ) | (mi, . . . , m s ) seni). 



(mi, ... ,m 3 ) 

G [Mi] X ■ ■ ■ X [M s ] 



We use raie to measure the efficiency of an (Mi, . . . , M s ; n) code. 

Definition 4. The rate tuple (Ri, . . . , R s ) of an (Mi, . . . , M s ; n) code is defined 

as 

Ri = — - — - bits per transmission in the i-th sender, 
n 

Definition 5. A rate tuple (R\, R 2 , • • • , R s ) is achievable if there exists a sequence 
of (2 nR \2 nR2 , ... , 2 nR °;n) codes with P e (n) ^Oasn^oo. 

Definition 6. The capacity region of the MAC is the closure of the set of all the 
achievable rate tuples. 



Definition 7. A Multiple Access Channel is called Multiple Access Adder Channel 
(MA AC) when the input alphabets are X\ — X 2 — . . . — X s — {0, 1} and the 
channel output is the sum of the s input symbols: Y = Xi + X 2 + • • • + X s , so 
that the output alphabet is y = {0,1,2, ... ,s}. 

This is a deterministic channel in that a given s-tuple of inputs always yields 
the same output. 



1.2. Capacity Region. A simple characterization of the capacity region for the 
two-user MAC was first presented by Ahlswede [U in 1973. Ulrey generalized 
it to the MAC with more than two users. 

Let A C [s] — {1,2,... , s}, A c denote the complement of A in [s], R(A) = 

^2 ieA Rii X(A) = {Xi : i e A}, and I(X\ Y\Z) the conditional mutual information 
of random variable X and Y given random variable Z. 

Theorem 1 (Ulrey). The capacity region of the s-user Multiple Access Channel 
is the closure of the convex hull of the rate tuples (Rx, R 2 , • • • , R s ) satisfying 

R(A) < I(X(A); Y\X(A C )) for all the subsets A C [s] 

for some product distribution Px(%i)p2(%2) ■ ■ .p s (x s ). 



2. Results 

For the s-user MAAC, we now present a finite description of its capacity region. 

Let B(n,p) be the binomial distribution with n independent trials and prob- 
ability p of a success in each trial, i.e., if X ~ B(n,p), then Pr(X = k) = 
(™)p fc (l —p) n ~ k . Let H n {p) denote the Shannon entropy of B(n,p), i.e., 

n 

Hn(p) = -E (3Ai-pr* • log (O fc a-pr fc ) . 

fc=0 

Now we state our main theorem. 

Theorem 2. The capacity region for the s-user Multiple Access Adder Channel 
is the set 

.1. 

T 



{(R 1 ,R 2 ,...,R S ): R(A)<H lAl (-) for all AC[ S }}. 



This region is only one polyhedron. For example, the capacity region for the 
2-user adder channel is 

{(R 1 ,R 2 ): < R t < 1; < R 2 < 1; < R x + R 2 < 1/2}; 

the capacity region for the 3-user adder channel is 

{(Rx, R 2 , R 3 ) : R X <1;R 2 < 1; R 3 <l;Ri+R 2 < 1/2; R x + R 3 < 1/2; 

R2 + Rs< 1/2; R x + R 2 + R 3 < (3 - | log 3)}. 

A key part of our proof of Theorem [2] is the following result, conjectured by 
Lindstrom M: 

Theorem 3. Let Xx,X 2 ,... , X n be independent random variables = or 1 with 
Pr(Xi = 1) = pi and Pr(Xi = 0) = 1 —p^ where < pi < 1 for i = 1, 2, . . . , n. 



The Shannon entropy of the random variable Xi + X 2 + ■ ■ ■ + X n is then maximal 
when pi = 1/2 for all i — 1,2, . . . , n, i.e., 



max H(Y j X l )=H n (l/2) 



0<p 

8=1 



Remark. JTie result in Theorem [| /ias apparently obtained by Liao || in /iis 
Ph.D. dissertation in 1972, but his proof is never published. Our proof will include 
proving Theorem [|, conjectured in 1976. 

3. Proof of Results 

3.1. Theorem |3] implies Theorem [2|. To compute the capacity region for the 
s-user adder channel, by Theorem [I], we need to maximize 

I(X(A);Y\X(A C )) 

for any A C [s]. 

Since Y = X 1 + X 2 + ■ ■ ■ + X s is determined by X{A) and X(A C ) for all 
AC [s] , the conditional Shannon entropy of Y given X(A) and X(A C ) is zero, 
i.e., H(Y\X(A),X(A c )) = 0, so 

/(X(A);F|X(A C )) = i7(F|X(A c ))-i7(F|X(A),X(A c )) 
= H{Y\X{A C )) 

= ^H(Y\X(A c ) = x)-P r (X(A c )=x) 

X 

= ^#(^X + ^X|X(A c ) = x)-P r (X(A c ) = a;) 

x ieA ieA c 

= ^iJ(^X).P r (X(A c ) = x) 

x ieA 

= HC£x i )-Y,Pr{X{A c ) = x) 

ieA x 

ieA 

So, 

max I(X(A);Y\X(A C ))= max HfVlA 

0<pi,p2,...,p s <l {i£A: 0< Pi <l} *-^ 

ieA 

Thus, if Theorem |3] is true, then H (J2ieA X^j is maximized when p t = 1/2, i e A 
for all A C [s]. But when pj = 1/2, ieA, the random variable X^eA^* has the 
binomial distribution with parameter (\A\, 1/2), i.e., 

max J(X(A) ; r|X(A c ))= max PT(^X) = PT| A |(l/2) 

0<Pi,P2,---,Ps<l {ieA: 0< Pi <l} *-^ 

ieA 

for all AC [s]. This, together with Theorem [I], implies Theorem |2|. 



3.2. Proof of Theorem |3| (Lindstrom's conjecture). Let pi = P r (Xi = 

1),0 < Pi < 1, and qi = 1 - p u for i = 1,2, ... ,n. Let p = (pi,p 2 ,... ,Pn), 
Pi = (Pi, • • • ,Pi-hPi+i, ■■■ ,Pn), and more generally, for < r < n, Pi u i 2i ...; ir = 
(Ph,Pj2, ■ ■ ■ ,Pj n - r )i wnere 1 < h < h < ■ • ■ < jn-v < n and {j u j 2 , . . . ,j n -r} = 
[n]\{ii,i 2 , ... ,i r }- 
We need the following result of Mateev M . 

Lemma 1 (Mateev). The Shannon entropy of the binomial distribution B(n,p) 
reaches its maximum when p = 1/2, that is, 

max H n {p) = H n {l/2). 

0<p<l 



n 



Let Z p = y^X{. We want to show that the Shannon entropy H(Z P ) of Z p is 

maximized when pi = 1/2 for all i — 1, 2, . . . , n, i.e., 

max H(Z p ) = H n (l/2). 

0<pi,P2v,Pn<l 

Notice that, when pi = p 2 = • ■ ■ = p n — p, Z p has the binomial distribution with 
parameter (n,p), and H(Z P ) = H n (p). By Lemma [l], max < p <i H n (p) = H n {l/2). 
Therefore, to prove Theorem |3|, we only need to show that 

(1) max H(Z P ) = max H(Z p ) 

0<Pl,P2v ,Pn<l 0<pl=P2=-=Pn<l 

We introduce the following lemma, 

Lemma 2. Let p' = (p[, . . . ,p' n ) be any point in R n with < p' x , . . . ,p' n < 1. 

/ m 

Let 1 < I < m < n, and u = (0, . . . , 1, 0, . . . , —1, 0, . . . ,0). Then the function 
H(Z p ) restricted to the line p = p' + t ■ u, where t is any real number such that 
< p < 1, has a unique maximum at p\ = p m . 

Theorem |3] follows immediately from this lemma (which we will prove in the 
next paragraph). Indeed, let p* = (pl,p 2 , ■ ■ ■ ,Pn) De an y maximal point of H (Z p ). 
Then for any 1 < I < m < n, p* is also the maximal point of the function H(Z p ) 
restricted to the line p = p* + t ■ u . By Lemma 0, pi = p* m , for any /, m, which 
proves ( |l|) . 

Proof of Lemma §. We first notice that H(Z P ) is symmetric in pi,p2, 
. . . ,p n , so the function H(Z P ) restricted to the line p = p' + t-u is symmetric inp; 
and p m - We will prove that the function H(Z P ) restricted to the line p = p' + t ■ u 
is concave by showing that the second directional derivative of H(Z P ) in the 
direction u is strictly negative. The concavity guarantees a unique maximum; by 
the symmetry, this maximum occurs at pi = p m . 

Thus, to prove Lemma 0, and hence Theorem ^ and Theorem |2|, we must show: 

DlH(Z p ) < 

V p = (pi,p 2> • • • ,Pn) With 0<Pi,p 2 ,... ,p n < 1, 



where D^ = ^ — -^—. 

u dpi dpm 

The probability distribution of Z p is described by 



y IT Pi 1 [ 1j for < k < n 

b(k, p) = Pr(Z p — k) — { ^w i&A j&A? 

otherwise, 

where A c is the complement of A in [n] . Particularly, 

n n 

j=l i=l 

In general, the probability function Pr(Z p , , , — k) of Z p , , 

/ J l<i<n X{) IS 

{ ^ IT Pi 1 [ 1 3 - for < k < (n — r) 

AC[n]\{i 1 ,...,i r } ieA jeA c 
IAI=fc 
otherwise, 

where A c is the complement of A in [n] \ {ii, 12, ■ ■ ■ , i r }- 
The Shannon entropy of Z p is 



if(Z p ) = -^&(A;,p)log&(A;,p). 



fc=0 



To compute the derivatives of H(Z P ), we will repeatedly use the following formu- 

«|p) = E np , n * - e n*. n * 

Fi leAC[n] i^l j & A c l^BQln] ieB i±l 

\A\=k ieA \B\ = k jeB c 

= e n^ n *■- e n^ n ^ 

A'C[n]\{!} ieA' ie(A') c S'C[n]\{!} jgfi' j£{B'Y 

\A'\=k-l \B'\=k 

= b(k-l,pi) -b(k,p t ) 
= -A k b(k,pj), 

where Aj,f(k) = f(k) — /(&; — 1), the difference operator. On the boundaries 
k — 0, k — n, the above formula still holds: 

^T^ = - II % ■• = ~ b ^ Pf) = b (~^ Pi) " 6 (°> Pf) = - A ^(°' Pi) 

01 / \ 

= IJ Pi = K n - h Pi) = K n - 1, P«) - &("» Pf) = -A k b(n, p f ). 

l<i<n 



<9p; 



Let us compute the second derivatives of b(k, p): 
d 2 b(k, p) d%(k, p) d 

= 6(*"2, Pf, J - &(*-l, Pf,J " 6(*-l, Pf,J + Kk, Pi 
= A26(fc, Pl> ), 

d 2 b(k,p) d 

When fc = and fc = n, we have 
d 2 &(0, P) 
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(b(- 


-I.P1O- 


6(0, P,0 ) = 


<9Pm 


Kj<n 


11 


Qj = 


= &(0»Pf,, 


J 






Kj<r. 


', 










j^l,m 












b{~1 


',Pi>)-26(- 


- 1 .Pfm) + 


6(0, Pi, J 


= A|6(0,PfJ, 


d 

dp m 


( 6(r? 


■-i.Pj) 


- &(«> Pf) 


)=— ( 


n »-°) 

K»<n 



d 2 b(n, p) 

= II Pi = 6 (" " 2 ' Pi,m) 

l<i<n 

= 6(n - 2, Pj> ) - 26(n - 1, Pj> ) + &(n, Pj> ) = A 2 fe &(n, Pj> ). 

Then let us compute the derivatives of H(Z p ): 

dH(Z p ) ^ db(k, p) ^ 1 06(fc,p) 1 
~^ = ~Z^^ log &(fc, P ) ->&(«, P) '—I r-— o r^r 

dpi ^ dp* ^ 6(A;,p) dpi In 2 

a n 

/)/, ^ *" \n2dp~i^ b{k,P) 

fe=0 yL ^ fe=0 



y^ g6(fe, p) 19 A,,, 



^db(k,p) 1 9 

g^^T' log6(A; ' p) "hT2^ 

fc=0 F 



J^ A fc 6(A;, p,-) -log&(A;,p) 



fc=0 



and the second derivatives of H(Z P ): 

d 2 H{Z p ) J^d 2 b(k,p) ^db(k,p) 1 db(k,p) 



dpidp m 


h <w*. b ^' h * w-p) <*■ 




= - ±AIHK p, A ) • logK*. p) - £ A ' 6( * n f £** p *> , 


d 2 H(Z p ) 
dpidpi 


" (A k b(k,p t )) 2 
^ o ln2-b(k,p)- 



The second directional derivative of H(Z P ) in the direction u is 

DlH{z ) _ d 2 H(Z p ) 2 2 H(Z P ) | d 2 H(Z p ) 
P dp 2 dpidp m dp 2 m 

fe=0 V ' ^ y fc=0 

-A A fc 6(fc,p f ) ■ A fc 6(/c,p A ) _ >A (Afc&t^p^)) 2 
^ ln2-6(fc,p) ^ ln2.&(fc,p) 

= 2 £ AjKfc, p,, J ■ log «*, p) - Y, j^^j ■ 

The second term of D\ is clearly negative. We will prove that the first term is 
strictly negative, therefore D 2 i H{Z p ) < will be proved. 
To evaluate the first term, we use "summation by parts" : 



J2(&kf(k))-g(k) = -J2f(k-l)-(A k g(k)) provided f(n) = /(-l) = 0. 

fc=0 k=0 

Noticing that 

b(k, P;^) = when k < or k > n — 2 
A k b(k, Pifa) = when A; < or k > n — 1, 



we can apply summation by parts twice to the first term: 

n 

J2^tb(k,Pi,J-logb(k,p) 



k=0 



-J2 A kb(k-l,p t> J-A k (logb(k,p)) 



k=0 
n 

= Y, b (k~2,p t J.Al(\ogb(k,p)) 

fc=0 

Since b(k, P[^)'s are probabilities, they are non-negative, and at least one of them 
is positive. Finally, we have reduced our proof to the following lemma: 

Lemma 3. b(k, p) is log-concave as a function of k, that is, 

Al(logb(k,p)) < 0, for any < p < 1. 

Proof of Lemma ^. Since 

A 2 k (logb(k,p)) = hgb{k,p)-2logb{k-l,p)+hgb{k-2,p) 

6(fc-2,p)-6(fc,p) 

og (b(k-i, P )y ' 

we only need to show that 

b(k-2,p)-b(k,p)< (6(fc-l, p)) 2 , 

or equivalently, 

b(k-l,p)-b(k + l,p)<(b(k,p)) 2 . 

Let us expand the left-hand side of the inequality: 

b(k-l,p)-b(k + l,p) 

= J2 Up- n ^ Yi n^ n ^ 

AQ[n] a€A a'€A c BC[n] b£B b' eB c 

\A\ = k-l \B\=k+l 

J2 tip'TIp* n ^ n * 

A,sc[n] a£A b€B a'€A c b'<=B c 

\A\ = k-l,\B\ = k+l 

= e nrf n » n <? n *- 

A,BC[n] AnB (AUB)\(AnB) (AUB) C (AnB) c \(AL>B) c 

/l ^ fC — _L , D ^ "H - -t 

b ee e (Esmrfn* n «? n *■■ 

w=0 IC[n] JC[n]\I I J (/UJ) C / C \(IUJ) C 

|/|— w \J\—2k—2w 

Equality (1) may be illustrated by Figure 1. 

9 



J \J\ = 2k-2w 




Figure 1. 



The right-hand side of the inequality can be simplified as: 

= yi H pc n & '-J2 H pd n q* 

CC[n] c£C c'£C c DC[n] dS-D d'eD c 

\C\=k \D\ = k 

= y, tip'TIp* n ^ n ^ 

c,dch C gc deD c'ec c d'eD c 

= TnrfiM + e n*n*iM 



?d' 



C=DC[n] C C c 

|C|=fc 



C^DC[n] C D C c 

\C\=\n=h 



D c 



> 



(2) 



(3) 



e n^n^n^n 



c^dch c D C c 

\C\ = \D=h 

fc-1 



D c 



ee e PEonrfn* n <? n *• 

/ J (/UJ) C I C \(IUJ) C 



10=0 IC[n] JC[n]\I 
J|=ui |J|=2fc— 2to 

fc-1 



ee e G^onrfn* n »i n <&• 

«;=0 rC[n] JC[n]\I / J (iUJ) c 7" C \(/UJ) C 

|/|— it) \J\—2k—2w 

b(k-l,p)-b(k+l,p). 



Equality (2) can be illustrated by Figure 2., and inequality (3) is true since 
( n) > (n-i) ' Therefore Lemma ^ is proved. 
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Figure 2. 
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